Comparative Study of Topic Segmentation Algorithms Based on Lexical Cohesion: Experimental Results on Arabic Language
نویسندگان
چکیده
صلاخلا ـ ة : ُّ دعتُ ل ثم ةيعيبطلا تاغللا ةجلاعم تاقيبطت نم ديدعلل ايساسأ انوكم ةيعوضوملا ةئزجتلا قيبطت تامولعملا عاجرتساو صوصنلا صيخلت . نم فدهلا وه ثحبلا اذ ه قت و مي ةيعوضوملا ة ئزجتلا تا يمزراوخ ة يلاعف ضوملا دودحلا ى لع فرعتلا يف ة يبرعلا صوصنلا لخاد ةيعو . م تو قايس لا اذ ه يف ة فلتخم رداصم نم ةيبرع صوصن ةسمخ لخاد اهنوظحلاي يتلا ةركفلا وأ عوضوملا تاريغت ىلع فرعتلل ةيبرعلا ةغللا ءارق نم ةعبس ءاعدتسا . فوسو مادختسا متي يف ةجتانلا ءارلآا قت و مي راهتشا رثآلأا نيتيمزراوخلل ةيبسنلا ةيلاعفلا يس ة يمزراوخ ا مهو لاأ ةيعوضوملا ةئزجتلا لاجم يف ا 99 ة يمزراوخو صنلا ديمرق ) تسآات نيليت ( سيياقم مادختساب كلذو قت و مي ءارآ ة قيرط ل ثم ةد يدج ىرخأ سييا قمو ةقدلاو عاجرتسلاا لثم ةفورعم ) ما كحأ ( ءار قلا . و ني بت وخلا نإ ف ة فيفطلا تانيس حتلا ضعب ءارجإ ب ه نأ ة يبيرجتلا جئا تنلا ةحلا ص حبص ت ة يزيلجنلإا صوص نلل ةيعو ضوملا ة ئزجتلا ي ف ةمدختس ملا تا يمزرا ةيبرعلا صوصنلا عم مادختسلال .
منابع مشابه
Enhancing lexical cohesion measure with confidence measures, semantic relations and language model interpolation for multimedia spoken content topic segmentation
Transcript-based topic segmentation of TV programs faces several difficulties arising from transcription errors, from the presence of potentially short segments and from the limited number of word repetitions to enforce lexical cohesion, i.e., lexical relations that exist within a text to provide a certain unity. To overcome these problems, we extend a probabilistic measure of lexical cohesion ...
متن کاملA probabilistic segment model combining lexical cohesion and disruption for topic segmentation (Un modèle segmental probabiliste combinant cohésion lexicale et rupture lexicale pour la segmentation thématique) [in French]
A probabilistic segment model combining lexical cohesion and disruption for topic segmentation Identifying topical structure in any text-like data is a challenging task. Most existing techniques rely either on maximizing a measure of the lexical cohesion or on detecting lexical disruptions. A novel method combining the two criteria so as to obtain the best trade-off between cohesion and disrupt...
متن کاملThe Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials
The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...
متن کاملThe Role of Self-Regulatory Approach in Iranian Learners' Lexical Segmentation: The case of authentic materials
The present research investigated the effect of self-regulatory approach (with two components of self-checking and self-efficacy) on pre-intermediate Iranian learners' lexical segmentation in listening comprehension via authentic listening comprehension texts. To achieve this purpose, the investigators administered an Oxford Placement Test (2007) to ninety-eight students of two girls’ private j...
متن کاملImproving Text Segmentation with Non-systematic Semantic Relation
Text segmentation is a fundamental problem in natural language processing, which has application in information retrieval, question answering, and text summarization. Almost previous works on unsupervised text segmentation are based on the assumption of lexical cohesion, which is indicated by relations between words in the two units of text. However, they only take into account the reiteration,...
متن کامل